Biodiversity informatics: the challenge of linking data and the role of shared identifiers

نویسنده

  • Roderic D. M. Page
چکیده

A major challenge facing biodiversity informatics is integrating data stored in widely distributed databases. Initial efforts have relied on taxonomic names as the shared identifier linking records in different databases. However, taxonomic names have limitations as identifiers, being neither stable nor globally unique, and the pace of molecular taxonomic and phylogenetic research means that a lot of information in public sequence databases is not linked to formal taxonomic names. This review explores the use of other identifiers, such as specimen codes and GenBank accession numbers, to link otherwise disconnected facts in different databases. The structure of these links can also be exploited using the PageRank algorithm to rank the results of searches on biodiversity databases. The key to rich integration is a commitment to deploy and reuse globally unique, shared identifiers [such as Digital Object Identifiers (DOIs) and Life Science Identifiers (LSIDs)], and the implementation of services that link those identifiers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Trouble with Triplets in Biodiversity Informatics: A Data-Driven Case against Current Identifier Practices

The biodiversity informatics community has discussed aspirations and approaches for assigning globally unique identifiers (GUIDs) to biocollections for nearly a decade. During that time, and despite misgivings, the de facto standard identifier has become the "Darwin Core Triplet", which is a concatenation of values for institution code, collection code, and catalog number associated with biocol...

متن کامل

The Use of Online Social Networks and Their Role in Sharing Health Information among Pregnant Women of Kerman

Background and Aim: The online social networks as new and widespread sources of information have been able to facilitate the accessibility of people to health information. The aim of this study was to determine the use of online social networks and their role in sharing health information among pregnant women. Materials and Methods: This descriptive-analytical study was conducted in Kerman in ...

متن کامل

Basic Properties of the Persona Model

This document proposes a terminology and a model for representation of user data in information systems in the form of “persona” objects. It provides the mechanisms for evaluation how the personae relate to real-world subjects or to each other. A mechanism how to evaluate some anonymity and identity properties is proposed. This paper also describes the linking of personae by the use of shared i...

متن کامل

Community Next Steps for Making Globally Unique Identifiers Work for Biocollections Data

Biodiversity data is being digitized and made available online at a rapidly increasing rate but current practices typically do not preserve linkages between these data, which impedes interoperation, provenance tracking, and assembly of larger datasets. For data associated with biocollections, the biodiversity community has long recognized that an essential part of establishing and preserving li...

متن کامل

Teacher training students’ experiences of the role of Coach in linking theory and practice in practicum

Future teachers of education are Current students that  focus on the development of teacher knowledge in the field of theory and practice in practicum as an important part of the teacher training program. One of the effective factors in the pre-service programs is schools coach, The study seeks it. The reserch method used in this study was qualitative method and was of phenomenological type. An...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Briefings in bioinformatics

دوره 9 5  شماره 

صفحات  -

تاریخ انتشار 2008